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Abstract. A grammar model for concurrent, object-ori- 
ented natural language parsing is introduced. Complete 
lexical distribution of grammatical knowledge is achieved 
building upon the head-oriented notions of valency and 
dependency, while inheritance mechanisms are used to 
capture lexical generalizations. The underlying concurrent 
computation model relies upon the actor paradigm. We 
consider message passing protocols for establishing de- 
pendency relations and ambiguity handling. 

1 INTRODUCTION 

In this paper, we propose a grammar model that combines 
lexical organization of grammatical knowledge with lexi- 
calized control of the corresponding parser in an object- 
oriented specification framework. Recent developments in 
the field of linguistic grammar theory have already yielded 
a rigid lexical modularization. This fine-grained decompo- 
sition of linguistic knowledge can be taken as a starting 
point for lexicalized control. Current lexicalized grammars 
(for instance, HPSG: Pollard & Sag, 1987; CG: Hepple, 
1992; Lexicalized TAG: Schabes, Abeille & Joshi, 1988), 
however, still consider lexical items as passive data con- 
tainers whose content is uniformly interpreted by global 
control mechanisms (e.g., unification, functional composi- 
tion, tree adjunction). Diverging from these premises, we 
assign full procedural autonomy to lexical units and treat 
them as active lexical processes communicating with each 
other by message passing. Thus, they dynamically estab- 
lish heterogeneous communication lines in order to deter- 
mine each lexical item's functional role. While the issue of 
lexicalized control has early been investigated in the para- 
digm of conceptual parsing (Riesbeck & Schank, 1978), 
and word expert parsing in particular (Small & Rieger, 
1982), these proposals are limited in several ways. First, 
they do not provide any general mechanism for the sys- 
tematic incorporation of grammatical knowledge. Second, 
they do not supply any organizing facility to formulate 
generalizations over sets of lexical items. Third, lexical 
communication is based on an entirely informal protocol 
that lacks any grounding in principles of distributed com- 
puting. 

We intend to remedy these methodological shortcom- 
ings by designing a radically lexicalized grammar on the 
basis of valency and dependency (these head-oriented 
notions already figure in different shapes in many modern 
linguistic theories, e.g., as subcategorizations, case frames, 
theta roles), by introducing inheritance as a major organi- 
zational mechanism (for a survey of applying inheritance 



in modern grammar theory, cf. Daelemans, De Smedt & 
Gazdar, 1992), and by specifying a message passing proto- 
col that is grounded in the actor computation model (Agha 
& Hewitt, 1987). As this protocol allows for asynchronous 
message passing, concurrency enters as a theoretical 
notion at the level of grammar specification, not only as an 
implementational feature. The ParseTalk model outlined 
in this paper can therefore be considered as an attempt to 
replace the static, global-control paradigm of natural lan- 
guage processing by a dynamic, local-control model. 

The design of such a grammar and its associated parser 
responds to the demands of complex language perfor- 
mance problems. By this, we mean understanding tasks, 
such as large-scale text or speech understanding, which 
not only require considerable portions of grammatical 
knowledge but also a vast amount of so-called non-lin- 
guistic, e.g., domain and discourse knowledge. A major 
problem then relates to the interaction of the different 
knowledge sources involved, an issue that is not so press- 
ing when monolithic grammar knowledge essentially boils 
down to syntactic regularities. Instead of subscribing to 
any serial model of control, we build upon evidences from 
computational text understanding studies (Granger, Eiselt 
& Holbrook, 1986; Yu & Simmons, 1990) as well as psy- 
cholinguistic experiments, in particular those worked out 
for the class of interactive language processing models 
(Marslen- Wilson & Tyler, 1980; Thibadeau, Just & Car- 
penter, 1982). They reveal that various knowledge sources 
are accessed in an a priori unpredictable order and that a 
significant amount of parallel processing occurs at various 
stages of the (human) language processor. Therefore, com- 
putationally and cognitively plausible models of natural 
language understanding should account for parallelism at 
the theoretical level of language description. Currently, 
ParseTalk provides a specification platform for computa- 
tional language performance modeling} In the future, this 
vehicle can be used as a testbed for the configuration of 
cognitively adequate parsers. Moving performance consid- 
erations to the level of grammar design is thus in strong 



We only mention that performance issues become even more pressing 
when natural language understanding tasks are placed in real-world 
environments and thus additional complexity is added by ungrammati- 
cal natural language input, noisy data, as well as lexical, grammatical, 
and conceptual specification gaps. In these cases, not only multiple 
knowledge sources have to be balanced but additional processing strat- 
egies must be supplied to cope with these phenomena in a robust way. 
This places extra requirements on the integration of procedural linguis- 
tic knowledge within a performance-oriented language analysis frame- 
work, viz. strategic knowledge how to handle incomplete or faulty 
language data and grammar specifications. 



contrast to any competence-based account which assigns 
structural well-formedness conditions to the grammar 
level and leaves their computation to (general-purpose) 
parsing algorithms, often at the cost of vast amounts of 
ambiguous structural descriptions. 

2 ParseTalk's CONCEPTUAL FRAMEWORK 

The ParseTalk model is based on a fully lexicalized gram- 
mar. Grammatical specifications are given in the format of 
valency constraints attached to each lexical unit, on which 
the computation of concrete dependency relations is 
based. By way of inheritance the entire collection of lexi- 
cal items is organized in lexical hierarchies (these consti- 
tute the lexical grammar), the lexical items forming their 
leaves and the intermediary nodes representing grammati- 
cal generalizations in terms of word classes. This specifi- 
cation is similar to various proposals currently investi- 
gated within the unification grammar community (Evans 
& Gazdar, 1990). The concurrent computation model 
builds upon and extends the formal foundations of the 
actor model, a theory of object-oriented computation that 
is based on asynchronous message passing. 

2.1 The Grammar Model 

The grammar model underlying the ParseTalk approach 
considers dependency relations between words as the fun- 
damental notion of linguistic analysis. A modifier is said to 
depend on its head if the modifier's occurrence is permit- 
ted by the head but not vice versa . Dependencies are thus 
asymmetric binary relations that can be established by 
local computations involving only two lexical items; they 
are tagged by dependency relation names from the set £> = 
{spec, subj, ppatt, ...} . Co-occurrence restrictions be- 
tween lexical items are specified as sets of valencies that 
express various constraints a head places on permitted 
modifiers. These constraints incorporate the following 
descriptive units: 

1. categorial: C = {WordActor, Noun, Substantive, Prepo- 
sition, ...} denotes the set of word classes, and isa c = 
{(Noun, WordActor), (Substantive, Noun), (Preposition, 
WordActor), ...} c C X C denotes the subclass relation 
yielding a hierarchical ordering in C (cf. also Fig.l). 

2. morphosyntactic: A unification formalism (similar in 
spirit to Shieber, 1986) is used to represent morphosyn- 
tactic regularities. It includes atomic terms from the set 
1= {nom, acc, sg, pi, ...}, complex terms associat- 
ing labels from the set L = {case, num, agr, ...} u D 
with embedded terms, value disjunction (in curly 
braces), and coreferences (numbers in angle brackets). 
U denotes the set of allowed feature structures, V the 



2 Although phrases are not explicitly represented (e.g., by non-lexical 
categories), we consider each complete subtree of the dependency tree 
a phrase (this convention allows discontinuous phrases as well). A 
dependency is not treated as a relation between words (as in Word 
Grammar (Hudson, 1990, p. 117), but between a word and a dependent 
phrase (as in Dependency Unification Grammar (Hellwig, 1988)). The 
root of a phrase is taken to be the representative of the whole phrase. 

3 Additionally, © contains the symbol self which denotes the currently 
considered lexical item. This symbol occurs in feature structures (see 2. 
below) and in the ordering relations order and occurs (4. below). 



unification operation, _L the inconsistent element. 
Given u e U and I e L, the expansion [I : u] denotes the 
complex term containing only one label, I, with value 
u. If u is a complex term containing I at top level, the 
extraction u\l is defined to be the value of I in u. By def- 
inition, u\l yields _L in all other cases. 

3. conceptual: The concept hierarchy consists of a set of 
concept names f = {Hardware, Computer, Notebook, 
Harddisk, ...} and a subclass relation isa 7 = {(Com- 
puter, Hardware), (Notebook, Computer), ...} <z fx f. 
The set of conceptual role names ^ = {HasPart, 
HasPrice, ...} contains labels of possible conceptual 
relations (a frame-style, classification-based knowl- 
edge representation model in the spirit of MacGregor 
(1991) is assumed). The relation cic c J x %Y. <y imple- 
ments conceptual integrity constraints: (f, r, g) e cic iff 
any concept subsumed by / e J may be modified by 
any concept subsumed by g e J in relation re H, e.g, 
(Computer, hasPart, Harddisk) e cic. From cic the rela- 
tion permit = {(^r,y)e J X KJX. f I 3f,£ e J: (f,r,g) 
e cic a £ isa T * f a y isa r * g} (* denotes the transitive 
closure) can be derived which explicitly states the 
range of concepts that can actually be related. For brev- 
ity, we restrict this exposition to the attribution of con- 
cepts and do not consider quantification, etc. (cf. 
Creary & Pollard, 1985). 

4. ordering: The (word-class specific) set order a £> n 
contains n-tuples which express ordering constraints 
on the valencies of each word class. Legal orders of 
modifiers must correspond to an element of order. The 
(word specific) function occurs : © — > 9{q associates 
dependency names with the modifier's (and self's) text 
position (0 for valencies not yet occupied). Both speci- 
fications appear at the lexical head only, since they 
refer to the head and all of its modifiers. 

With these definitions, a valency can be characterized as 
an element of the set V c D x c x 11 x %. Focusing on one 
dependency relation from the example "Compaq entwik- 
kelt einen Notebook mit einer 120-MByte-Harddisk" 
["Compaq develops a notebook with a 120-MByte hard 
disk"], the above criteria are illustrated in Table 1. The fea- 
ture structure of the two heads, "mit" and "Notebook", is 
given prior to and after the establishment of the depen- 
dency relation. The concepts of each of the phrases, 
120MB-HARDDISK-00004 and NOTEBOOK-00003, are 
stated. The order constraint of "Notebook" says that it may 
be preceded by a specifier (spec) and attributive adjectives 
(attr), and that it may be followed by prepositional phrases 
(ppatt). The valency for prepositional phrases described in 
the last row states which class, feature, and domain con- 
straints must be fulfilled by candidate modifiers. 

The predicate SATISFIES (cf. Table 2) holds when a 
candidate modifier fulfills the constraints stated in a speci- 
fied valency of a candidate head. If SATISFIES evaluates 
to true, a dependency valency, name is established (objec- 
t.attribute denotes the value of the property attribute at 
object). As can easily be verified, SATISFIES is fulfilled 
for the combination of "mit", the prepositional valency, 
and "Notebook" from Table 1. 
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TABLE 1. An illustration of grammatical specifications in the ParseTalk model 



SATISFIES (modifier, valency, head) :<=> 

modifier.class isa* valency.class 
a (([valency.name:(modifier.features\self)] V valency.features) 

V head.features) # L 
a 3 role g valency.domain : 

(head.concept, role, modifier.concept) g per mit 
a 3 <d-|,... d n > Ghead. order : 3 k g{1 , ..n} : 

(valency.name = 

a (V 1 < i < k : (head, occurs (dj) < modifier.position)) 
a (V k < i < n : (head.occurs (d|) = 

w head, occurs (dj) > modifier.position)) 



TABLE 2. The SATISFIES predicate 

Note that unlike most previous dependency grammar for- 
malisms (Starosta & Nomura, 1986; Hellwig, 1988; Jap- 
pinen, Lassila & Lehtola, 1988; Fraser & Hudson, 1992) 
this criterion assigns equal opportunities to syntactic as 
well as conceptual conditions for computing valid depen- 
dency relations. Information on word classes, morphosyn- 
tactic features, and order constraints is purely syntactic, 
while conceptual compatibility introduces an additional 
description layer to be satisfied before a grammatical rela- 
tion may be established (cf. Muraki, Ichiyama & Fukumo- 
chi, 1985; Lesmo & Lombardo, 1992). Note that we 
restrict the scope of the unification module in our frame- 
work, as only morphosyntactic features are described 
using this subformalism. This contrasts sharply with stan- 
dard unification grammars (and with designs for depen- 
dency parsing as advocated by Hellwig (1988) and Lom- 
bardo (1992)), where virtually all information is encoded 
in terms of the unification formalism 4 . 



2.1.1 A Look at Grammatical Hierarchies 

The grammatical specification of a lexical entry consists of 
structural criteria (valencies) and behavioral descriptions 
(protocols). In order to capture relevant generalizations 
and to support easy maintenance of grammar specifica- 
tions, both are represented in hierarchies (cf. Genthial, 
Courtin & Kowarski (1990) and Fraser & Hudson (1992) 
for inheritance that is restricted to structural criteria). The 
valency hierarchy assigns valencies to lexemes. We will 
not consider it in depth here, since it captures only tradi- 
tional grammatical notions, like transitivity or reflexivity. 
The organizing principle is the subset relation on valency 
sets. The word class hierarchy contains word class specifi- 
cations that cover distributional and behavioral properties. 
Fig. 1 illustrates the behavioral criterion by defining for 
each class different messages (the messages for Word- 
Actor are discussed in Sections 3 and 4). Within the Noun 
part of the word class hierarchy, there are different meth- 
ods for anaphora resolution reflecting different structural 
constraints on possible antecedents for nominal anaphora, 
reflexives and personal pronouns. The word class hierar- 
chy cannot be generated automatically, since classification 
of program specifications (communication protocols, in 
our case) falls out of the scope of state-of-the-art classifier 



Typed unification formalisms (Emele & Zajac, 1990) would easily 
allow for the integration of word class information. Ordering con- 
straints and conceptual restrictions (such as value range restrictions or 
elaborated integrity constraints), however, are not so easily transfer- 
able, because, e.g., the conceptual constraints go far beyond the level 
of atomic semantic features still prevailing in unification formalisms. 
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FIGURE 1. Fragment of the word class hierarchy 
algorithms. On the other hand, the concept hierarchy is 
based on the subsumption relation holding between con- 
cepts, which is computed by a terminological classifier. 
Most lexicon entries refer to a corresponding domain con- 
cept and thus allow conceptual restrictions to be checked. 

2.2 The Actor Computation Model 

The actor model of computation combines object-oriented 
features with concurrency and distribution in a method- 
ologically clean way. It assumes a collection of indepen- 
dent objects, the actors, communicating via asynchronous 
message passing. An actor can send messages only to 
other actors it knows about, its acquaintances. The arrival 
of a message at an actor is called an event; it triggers the 
execution of a method that is composed of atomic actions, 
viz. creation of new actors ( create actorType (acquaintan- 
ces)), sending of messages to acquainted or a newly cre- 
ated actors ( send actor message), or specification of new 
acquaintances ( become (acquaintances)). An actor system 
is dynamic, since new actors can be created and the com- 
munication topology is reconfigurable. We assume actors 
that process a single message at a time, step by step 
(Hewitt & Atkinson, 1979). For convenience, we establish 
a synchronous request-reply protocol (Lieberman, 1987) 
to compute functions such as unification of feature struc- 
tures and queries to a (conceptual) knowledge base. In 
contrast to simple messages which unconditionally trigger 
the execution of a method at the receiving actor, we define 
complex word actor messages as full-fledged actors with 
independent computational abilities. Departure and arrival 
of complex messages are actions which are performed by 
the message itself, taking the sender and the target actors 
as parameters. Upon arrival, a complex message deter- 
mines whether a copy is forwarded to selected acquaintan- 
ces of its receiver and whether the receiver may process 
the message on its own (cf. Schacht, Hahn & Broker 
(1994) for a treatment of the parser's behavioral aspects). 

The following syntax elements will be used subse- 
quently: a program contains actor definitions (declaring 
the acquaintances and defining the methods of actors 
instantiated from this definition) and actor message defini- 
tions (stating distribution and computation conditions). 
Method definitions contain the message key, the formal 
parameters and a composite action: 

actorDef ::= def Actor actorType (acquaintance*) 
methodDef* 



methodDef ::= meth messageKey (param*) (action) 
messDef ::= defMsg messageType (acquaintance*) 
(((if condition distributeTo tag))* 
if condition compute 
((if condition distributeTo tag))*) 
action ::= action; action 

| rf condition (action) [ else (action) ] 
| send actor messageKey (param*) 
| become (acquaintance*) 
| create actorType (acquaintance*) 
| for var in set : (action) 

condition is a locally computable predicate, written as 
PREDICATE (actor*); actor stands for acquaintances, pa- 
rameters, newly created actors, the performing actor itself 
( self ) or the undefined value (nil); actor.acquaintance 
yields the corresponding acquaintance of actor; for var in 
set: (action) evaluates action for each element of set. 

3 A SIMPLIFIED PROTOCOL FOR ESTAB- 
LISHING DEPENDENCY RELATIONS 

The protocol described below allows to establish depen- 
dency relations. It integrates structural restrictions on de- 
pendency trees and provides for domesticated concurrency 

3.1 Synchronizing Actor Activities: Reception Protocol 

A reception protocol allows an actor to determine when all 
events (transitively) caused by a message have terminated. 
This is done by sending replies back to the initiator of the 
message. Since complex messages can be quasi-recur- 
sively forwarded, the number of replies cannot be deter- 
mined in advance. In addition, each actor receiving such a 
message may need an arbitrary amount of processing time 
to terminate the actions caused by the message (e.g., the 
establishment of a dependency relation requires communi- 
cation via messages that takes indeterminate time). There- 
fore, each actor receiving the message must reply to the 
initiator once it has terminated processing, informing the 
initiator to which actors the message has been forwarded. 

A message is a reception message if (1) the receiver is 
required to (asynchronously) reply to the initiator with a 
receipt message, and (2) the initiator queues a reception 
task. An (explicit) receipt message is a direct message con- 
taining a set of actor identities as a parameter. This set 
indicates to which actors the reception message has been 
forwarded or delegated. The enclosed set enables the 
receiver (which is the initiator of the reception message) to 
wait until all receipt messages have arrived 5 . In addition to 
explicit receipts, which are messages solely used for termi- 
nation detection, there are regular messages that serve a 
similar purpose besides their primary function within the 
parsing process. They are called implicit receipt messages 
(one example is the headAccepted message described in 
Section 3.3). A reception task consists of a set of partial 
descriptions of the messages that must be received (im- 
plicit as well as explicit), and an action to be executed after 
all receipts have arrived (usually, sending a message). 



This, of course, only happens if the distribution is limited: The search- 
Head message discussed below is only distributed to the head of each 
receiver, which must occur in the same sentence. This ensures a finite 
actor collection to distribute the message to, and guarantees that the 
reception task is actually triggered. 



defActor wordActor (head deps vals feats ...) # head, dependencies, valencies, and features acquaintances 
meth searchHead (sender target init) # processed at candidate heads (compute from the message definition) 

( for val in vals: # check all valencies of the possible head 

(if SATISFIES (init val self) # valency check adapted from Table 2 

( send ( create headFound ( self init val. name feats\val.name)) depart : # reply to initiator, imposing restrictions 
become (head deps vals (feats V init.feats) ...) # expand grammatical description of head 
else ( send ( create receipt (self init {head})) depart ))) # send a receipt with the head the message was forwarded to 

# depart realizes the departure of a complex message 
meth headFound (sender target name headFeats) # processed at the initiator of a searchHead message 
( send ( create head Accepted ( self sender name)) depart ): # reply to head 

become (sender deps vals (feats V headFeats) ...)) # store sender as head of self, restrict self 's features 
meth headAccepted (modifier target name) # processed at the head only 
( for dep in deps: # check all dependencies 

(jf (name = dep. name) # relation name is identical 

( send dep store (modifier)))) # send the dependency the message store to store the modifier 

send ( create receipt (self modifier {head})) depart ) # send a receipt with the head the message was forwarded to 



TABLE 3. Method definitions for 
3.2 Encoding Structural Restrictions 

Word actors conduct a bottom-up search for possible 
heads; the principle of non-crossing arcs (projectivity of 
the dependency tree) is guaranteed by the following for- 
warding mechanism. Consider the case of a newly instanti- 
ated word actor w n searching its head to the left (the oppo- 
site direction is handled in a similar way). In order to guar- 
antee projectivity one has to ensure that only word actors 
occupying the outer fringe of the dependency structure 
(between the current absolute head W; and the rightmost 
element w n _j) receive the search message of w n (these are 
circled in Fig. 2) . This forwarding scheme is reflected in 
the following simplified message definition: 

defMsg searchHead (sender target initiator) 
((if GOVERNED (target) distributed head) 

# forward a copy to head, identified by head e © 
ij true compute ) 

# the message is always processed at the target; 

# the computation event is concretized in the word 

# actor specification in Table 3 

Thus, a message searching for a head of its initiator is 
locally processed at each actor receiving it, and is for- 
warded to the head of each receiver, if one already exists. 
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FIGURE 2. Forwarding a search message 
Additionally, direct messages are used to establish a de- 
pendency relation. They involve no forwarding and may 
be specified as follows: 

defMsg <directMessage> (sender target ...) 
(jf true compute ) 

# a direct message is always processed at the 

# target, no distribution condition can apply 

Below, a number of messages of this type are used for 
negotiating dependencies, e.g., headFound, headAc- 
cepted, receipt (each with different parameters, as repre- 
sented by above). 

6 Additionally, w n may be governed by any word actor governing Wj, but 
due to the synchronization implemented by the receipt protocol, each 
head of Wj must be located to the right of w n . 



searchHead, headFound, headAccepted 

3.3 An Excerpt from the Word Actor Script 

The protocol for bottom-up establishment of dependencies 
consists of three steps: The search for a head (search- 
Head), the reply of a suitable head to the initiator of the 
search (headFound), and the acceptance by the initiator 
(headAccepted), thereby becoming a modifier of the 
head. The corresponding method definitions are given in 
Table 3 (note that these methods are defined for one actor 
type here, but are executed by different actors during pars- 
ing). The protocol allows alternative attachments to be 
checked concurrently, since each actor receiving search- 
Head may process it locally, while the message is simulta- 
neously distributed to its head. 

The specification of methods as above gives a local 
view of an actor system, stating how each actor behaves 
when it receives a message. For a global view taking the 
actors' interaction patterns into account, cf. Schacht, Hahn 
& Broker (1994). 

4 AMBIGUITY HANDLING 

There are two alternative processing strategies for ambigu- 
ities, viz. serial vs. parallel processing. We here focus on a 
parallel mode, specifying only necessary serializations. 
Whenever an ambiguity is detected, additional actors are 
created to represent different readings. The standard three- 
step negotiation scheme for dependencies can easily be ac- 
commodated to this duplication process. When a word ac- 
tor receives the second (or n-th) headFound message it 
does not immediately reply with a headAccepted mes- 
sage, but initiates the copying of itself, its modifiers, and 
the prospective head (which, in turn, initiates copying its 
modifiers and head, if any). Copying modifiers proceeds 
by sending a copyStructure message to each actor in- 
volved, which evokes a (standard) headAccepted mes- 
sage returned by the actor copy. Copying the head is done 
via a duplicateStructure message, which will result in an- 
other headFound message to be returned. Since this 
headFound message is addressed to the ungoverned copy, 
the copy may reply as usual by sending a headAccepted 
message. Duplication of actors allows the concurrent pro- 
cessing of alternatives, and requires only limited overhead 
for the distribution of messages among duplicated actors. 



4.1 Packing Ambiguities 

Usually, a packed representation of ambiguous structures 
is preferred in the parsing literature (Tamura et al., 1991). 
This is feasible when syntactic analysis is the only deter- 
mining factor for the distribution of partial structures. But 
if conceptual knowledge is taken into account, the distri- 
bution of a phrase is not fully determined by its syntactic 
structure. Possible conceptual relations equally influence 
the distribution of the phrase. Additionally, the inclusion 
of an ambiguous phrase in a larger syntactic context 
requires the modification of the conceptual counterparts. 
In a packed representation, there would have to be several 
conceptual counterparts, i.e., only the syntactic representa- 
tion can be packed (and it might even be necessary to 
unpack it on-the-fly). Consequently, whenever conceptual 
analysis is integrated into the parsing process (as opposed 
to its interpretation in a later stage, thereby producing 
numerous ambiguities in the syntactic analysis), structure 
sharing is impossible, since different syntactic attachments 
result in different conceptual analyses, and no common 
structure is accessible that can be shared (cf. Akasaka 
(1991) for a similar argument). We expect that the over- 
head of duplication is compensated for by the ambiguity- 
reducing effects of integrating several knowledge sources. 

4.2 Relation to Psycholinguistic Performance Models 

It has been claimed that human language understanding 
proceeds in a more sequential mode, choosing one alterna- 
tive and backtracking if that path fails (e.g., Hemforth, 
Konieczny & Strube, 1993). This model requires the rank- 
ing of all alternatives according to criteria referring to syn- 
tactic or conceptual knowledge. The protocol outlined so 
far could easily be accommodated to this processing strat- 
egy: All headFound messages must be collected, and the 
corresponding attachments ranked. The best attachment is 
selected, and only one headAccepted message sent. In 
case the analysis fails, the next-best attachment would be 
tried, until an analysis is found or no alternatives are left. 
Additionally, the dependencies established during a failed 
path would have to be released. 7 

5 COMPARISON TO RELATED WORK 

The issue of object-oriented parsing and concurrency (for 
a survey, cf. Hahn & Adriaens, 1994) has long been con- 
sidered from a purely implementational perspective. Mes- 
sage passing as an explicit control mechanism is inherent 
to various object-oriented implementations of standard 
rule -based parsers (cf. Yonezawa & Ohsawa (1988) for 
context-free and Phillips (1984) for augmented PSGs). 
Actor-based implementations are provided by Uehara et 
al. (1985) for LFGs and Abney & Cole (1986) for GB 
grammars. Similarly, a parallel implementation of a rule- 



7 Note that all psycholinguistic studies we know of are referring to a con- 
stituency-based grammar model. Since our grammar is based on 
dependency relations, principles such as Minimal Attachment cannot 
be transferred without profound modification, since in a dependency 
tree the number of nodes is identical for all readings. Therefore, princi- 
ples adapted to the structural properties of dependency trees must be 
formulated for preferential ranking. 



based, syntax-oriented dependency parser has been 
described by Akasaka (1991). The consideration of con- 
currency at the grammar specification level has recently 
been investigated by Milward (1992) who properly relates 
notions from categorial and dependency grammar with a 
state logic approach, a formal alternative to the event-alge- 
braic formalization underlying the ParseTalk model. 

Almost any of these proposals lack serious accounts of 
the integration of syntactic knowledge with conceptual 
knowledge (cf. the end of Section 2. 1 for similar consider- 
ations related to dependency grammars). The development 
of conceptual parsers (Riesbeck & Schank, 1978), how- 
ever, was entirely dominated by conceptual expectations 
driving the parsing process and specifically provided no 
mechanisms to integrate linguistic knowledge into such a 
lexical parser in a systematic way. The pseudo-parallelism 
inherent to these early proposals, word expert parsing in 
particular (Small & Rieger, 1982), has in the meantime 
been replaced by true parallelism, either using parallel 
logic programming environments (Devos, Adriaens & 
Willems, 1988), actor specifications (Hahn, 1989) or a 
connectionist methodology (Riesbeck & Martin, 1986), 
while the lack of linguistic sophistication has remained. 

A word of caution should be expressed regarding the 
superficial similarity between object-oriented and connec- 
tionist models. Connectionist methodology (cf. a survey 
by Selman (1989) of some now classical connectionist nat- 
ural language parsing systems) is restricted in two ways 
compared with object-oriented computing. First, its com- 
munication patterns are determined by the hard-wired 
topology of connectionist networks, whereas in object-ori- 
ented systems the topology is flexible and reconfigurable. 
Second, the type and amount of data that can be exchanged 
in a connectionist network is restricted to marker and 
value passing together with severely limited computation 
logic (and-ing, or-ing of Boolean bit markers, determining 
maximum/minimum values, etc.), while none of these re- 
strictions apply to message passing models. These consid- 
erations equally extend to spreading activation models of 
natural language parsing (Charniak, 1986; Hirst, 1987) 
which are not as constrained as connectionist models but 
less expressive than general message passing models 
underlying the object-oriented paradigm. As should be 
evident from the preceding exposition of the ParseTalk 
model, the complexity of the data exchanged and compu- 
tations performed, in our case, require a full-fledged mes- 
sage-passing model. 

6 CONCLUSIONS 

The ParseTalk model of natural language understanding 
aims at the integration of a lexically distributed, depen- 
dency-based grammar specification with a solid formal 
foundation for concurrent, object-oriented parsing (cf. 
Hahn, Schacht & Broker (forthcoming) for a more elabo- 
rated presentation). It conceives communication among 
and within different knowledge sources (grammar, domain 
and discourse knowledge) as the backbone for complex 
language understanding tasks. The main specification ele- 
ments of the grammar model consist of categorial, mor- 



phosyntactic, conceptual, and ordering constraints in terms 
of valency specifications attached to single lexical items. 
The associated concurrent computation model is based on 
the actor paradigm of object-oriented programming. The 
ParseTalk model has been experimentally validated by a 
prototype system, a parser for German (for its implementa- 
tional status, cf. Schacht, Hahn & Broker, 1994). 
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